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ABSTRACT 

For  past  three  decade  Autoregressive  Integrated  Moving  Average  Models  are  one  of  the  most  talked  about 
models  used  for  financial  prediction  in  time  -series  analysis.  This  paper  endeavors  to  present  the  forecasting  of  daily 
stock  data  of  State  Bank  of  India  (SBI)  for  a period  of  2 years.  Box-Jenkins  methodology  of  developing  ARIMA  models 
is  used  for  forecasting  the  stock  prices.  The  results  reveals  that  ARIMA  (0,  7,  1)  is  the  most  suitable  model  found  for 
forecasting. 
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I.  INTRODUCTION 

Stock  Prediction  will  always  remain  an  interesting  area  for  the  researchers  to  explore.  It  has  gathered  the 
interest  of  the  researchers  to  develop  better  predictive  models  every  now  and  then.  Stock  price  prediction  is  an 
important  topic  in  finance  and  economics.  The  financial  institutions  and  individual  investors  need  an  effective 
strategy  to  take  decision  everyday  on  the  basis  of  predictions.  The  price  prediction  is  regarded  as  one  of  most 
difficult  task  to  achieve  in  financial  forecasting  due  to  complex  nature  of  stock  market  [1,  2,  3]  investment  risk 
from  the  stock  market.  This  remains  an  inspiring  aspect  for  scholars  to  evolve  new  predictive  models  [4]. 

Till  date  lot  many  models  and  techniques  had  been  advanced  to  predict  stock  prices.  Among  them 
ARIMA  models  are  the  most  popular  statistical  tools.  As  it  is  quoted  in  the  literature  now  a days  forecasting  can 
be  done  on  the  basis  of  two  different  viewpoints,  one  on  the  basis  of  Statistical  Techniques,  and  other  on  the  basis 
of  Artificial  Intelligence  Technique  [2].  ARIMA  models  are  known  to  be  very  efficient  for  forecasting  particularly 
for  short-term  prediction  in  the  field  of  financial  time  series  than  even  the  most  popular  ANNs  techniques 
[5,  6,  7]. Other  statistics  models  are  regression  method,  exponential  smoothing,  generalized  autoregressive 
conditional  heteroscedasticity  (GARCH).  Few  related  works  that  has  engaged  ARIMA  model  for  forecasting 
includes  [8,  9,  10,  11,  12,  13,  14,  15,  16,  17,  18]. 

This  paper  presents  extensive  process  of  building  ARIMA  models  for  short-term  price  prediction. 
The  results  obtained  from  real-life  data  established  the  potential  strength  of  ARIMA  models  to  provide 
stockholders  short-term  prediction  that  could  support  investment  decision  making  process. 

The  rest  of  the  paper  is  organized  as  follows.  Section  2 presents  brief  overview  of  Review  of  literature. 
Section  3 describes  ARIMA  model.  Section  4 presents  the  methodology  used  while  section  5 illustrates  the 
experimental  results  obtained.  The  paper  is  concluded  in  section  6. 
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II.  REVIEW  OF  LITERATURE 

Manoj  and  Edward  J.  (2016)  applied  ARIMAs  models  to  forecast  stock  prices  in  their  sector-specific  study  of 
forecasting  and  analyzing  Indian  sectoral  stock  prices.  Their  study  suggested  that  ARIMA  (1,  1,  0)  is  the  most  suitable 
model  to  be  used  for  forecasting  stock  prices.  It  suggested  that  the  stock  prices  are  upward  trends  and  could  be  considered 
as  a worthy  investment  if  examined  closely.  The  ARIMA  model  was  developed  on  the  stock  prices  of  6 sectors  viz. 
Banking,  Healthcare,  Automobiles,  Power,  Oil  & Gas  and  IT.  The  developed  models  have  one  common  characteristic  i.e. 
they  are  all  integrated  at  first  order  and  are  Autoregressive  models  with  lag  1 having  no  MA  characteristics.  The  data  used 
in  this  study  is  of  daily  observations  of  past  9 years  (Feb  2007  - April  2015)  with  1996  observations.11 

Adebiyi^  al.  (2014)  carried  out  research  on  stock  data  obtained  from  New  York  Stock  Exchange  (NYSE)  and 
Nigeria  Stock  Exchange  (NSE).  This  paper  presents  extensive  process  of  building  stock  price  predictive  model  using  the 
ARIMA  model.  The  data  used  in  the  research  are  historical  daily  stock  prices  taken  from  stock  exchanges  of  two  countries 
from  the  period  of  25th  April,  1995  to  25th  February,  2011  with  3990  observations.  The  results  obtained  from  the  paper 
using  ARIMA  model  shows  the  potential  of  ARIMA  models  to  predict  stock  prices  on  short-term  basis  satisfactorily.14 

Kwasi  and  Kobina  (2014)  described  modeling  and  forecasting  of  wholesale  cassava  monthly  prices  in  central 
region  of  Ghana  using  ARIMA  model.  The  experiment  carried  out  on  sample  forecasting  from  January  2013  to  December 
2013.  The  study  demonstrates  a good  performance  in  terms  of  explained  variability  and  predicting  power.10 

Vernwf  al.  (2015)  analyzed  ARIMA  and  state  space  modeling.  These  two  procedures  were  compared  for 
sugarcane  yield  modeling  and  forecasting  to  estimate  the  yield  of  sugarcane  crop  in  three  district  of  the  Haryana,  India. 
The  time  series  yield  data  from  1960-61  to  2006-07  of  sugarcane  crop  for  the  districts  under  consideration  compiled  from 
statistical  abstracts  of  Haryana.  The  Study  was  primarily  aimed  at  predicting  a future  value  on  the  basis.  Crop  yield 
forecasting  for  the  model  testing  period  i.e.  2002-03,2003-04,2004-05,2005-06,  2006-07  and  2007-08  were  achieved  to 
check  the  validity  of  the  models  developed  on  the  basis  of  sugarcane  yield  data  from  1960-61  to  2001-02.  ARIMA  and 
State  space  models  were  found  to  be  close  to  the  real  time  yields  and  both  the  procedure  were  considered  for  the 
operational  crop  yield  forecasting  purpose.  [8] 

T.  Ozaki  (1977):  Attempted  to  remove  the  difficulty  in  deciding  the  order  of  an  ARIMA 
(autoregressive  integrated  moving  average)  model  by  using  the  MAICE  (minimum  AIC  estimation)  procedure,  which 
selects  a model  by  using  Akaike's  Information  Criterion  (AIC).  The  Ale,  which  is  an  estimate  of  the  Kullback-Leibler 
information  quantity,  provides  a powerful  and  almost  automatic  procedure  for  the  identification  of  ARIMA  model.  It  is 
remarkable  that  if  we  restrict  ourselves,  as  was  suggested  by  B-J,  to  the  set  of  the  five  models  (1,  d,  0),  (2,  d,  0),  (0,  d,  1), 
(0,  d,  2)  and  (1,  d,  1),  the  MAICE  procedure  produces  almost  identical  results  as  those  obtained  by  the  B-J  procedure 
which  requires  highly  skillful  but  nevertheless  subjective  judgment. [13] 

Abdullahi,  and  Bakari  (2014)  examined  the  trend  or  pattern  in  the  Nigeria  capital  market,  as  well  as  how  to 
determine  a suitable  model  for  forecasting  the  Nigerian  stock  market  by  applying  ARIMA  model.  Among  the  series  of 
ARIMA  models  tested,  it  was  discovered  that  ARIMA  (2,  1,  2)  model  performs  best  since  it  has  a minimum  MAPE  and 
MAE  compared  with  the  other  models.  The  stock  data  used  for  this  paper  was  taken  from  the  period  of  1985  to  2008.  [12] 
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III.  ARIMA  MODEL 

ARIMA  model  was  first  introduced  by  Box  and  Jenkins  in  1970.  It  also  referred  to  as  Box- Jenkins  methodology 
comprised  of  set  of  activities  for  identifying,  estimating  and  diagnosing  ARIMA  models  with  time  series  data.  The  model 
is  one  of  the  most  protuberant  methods  in  financial  forecasting  [1,  6,  16].  ARIMA  models  have  shown  efficient 
competency  to  generate  short-term  forecasts.  It  constantly  outperformed  complex  structural  models  in  short-term 
prediction  [17].  In  ARIMA  model,  the  future  value  of  a variable  is  a linear  combination  of  past  values  and  past  errors, 
expressed  as  follows: 

Yt=0  0+  Oi  Yt-i  + 02  Yt-2  +...+  OpYt-p  + et  - 0i  et_i  - 02  et_2  - ...  - 0q  e t-q 

where, 

Yt  is  the  actual  value,  et  is  the  random  error  at  /,  Oj  and  0j  are  the  coefficients,  p and  q are  integers  that  are  often 
referred  to  as  autoregressive  and  moving  average,  respectively. 

The  steps  in  building  ARIMA  predictive  model  consist  of:  model  identification,  parameter  estimation  and 
diagnostic  checking  [18]. 

IV.  METHODOLOGY 

This  research  considered  daily  historical  stock  price  data  of  State  Bank  of  India  (SBI)  collected  from  Yahoo 
Finance  covering  the  period  starting  from  20  January2014to  20  January  2016.  In  all,  523  observations  were  used  for  fitting 
various  ARIMA  models.  On  31.10.2014  a split  of  shares  of  SBI  was  observed  and  data  was  adjusted  accordingly  to  make 
the  series  uniform  {on  web  page  this  notification  is  reflected  on  21st  November  2014  but  actual  split  on  Yahoo  Finance  is 
considered  on  31.10.14).  The  data  is  comprised  of  four  variables,  namely:  open  price,  low  price,  high  price  and  close  price 
respectively.  In  the  research,  open  price  of  the  data  represents  the  price  of  the  index  to  be  forecasted. 

The  method  used  in  the  study  to  develop  ARIMA  model  for  stock  price  prediction  is  explained  in  detail 
underneath. 

• The  original  data  taken  for  the  SBI  stock  is  split  in  the  ratio  1 : 10  for  a period  of  20. 01. 14  to  30.10.14. 

• Data  is  tested  for  stationarity  using  Dickey  Fuller  Test. 

• The  selected  data  was  found  non-  stationary  then  it  was  made  stationary  by  taking  first  difference  and  second 

difference  simultaneously. 

• Then  17  ARIMA  models  were  fitted  taking  p = [0,1,2],  q = [0,1,2]  and  d=  [0,1,2] 

• Interpretation  of  statistics  and  analysis  of  graphs  was  done  for  identifying  best  ARIMA  model. 

• Inferences  and  conclusions  were  drawn  accordingly. 

• To  determine  the  best  ARIMA  model  among  several  experiments  performed,  the  following  criteria  are  used  in  this 

study  for  each  stock  index. 

Figure  1 depicts  the  original  pattern  of  the  series  to  have  general  overview  whether  the  time  series  is  stationary  or 
not.  From  the  graph  below  the  time  series  have  random  walk  pattern. 
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It  is  evident  from  the  graph  above  that  the  series  is  non-stationary.  Therefore  Augmented  Dickey-Fuller  (ADF) 
unit  root  test  is  applied  on  opening  price  of  SBI  stock  at  BSE.  The  result  confirms  that  the  series  becomes  stationary 
after  the  first-  difference  of  the  data.  However  the  series  is  also  checked  at  the  second  difference  in  anticipation  of  getting 
better  results. 


Tablel:  Dickey-Fuller  Statistics 


Dickey-Fuller  Test 

Original 

First  Difference 

Second  Difference 

Tau  (Observed  value) 

-1.500 

-8.506 

-14.756 

Tau  (Critical  value) 

-0.903 

-0.907 

-0.907 

p-value  (one-tailed) 

0.826 

< 0.0001 

< 0.0001 

Here  p-value  (one-tailed)  < alpha  i.e.  < 0.0001  < 0.05.  So  the  series  is  now  stationary.  It  is  very  evident  from 
graph  below. 


Figure  2:  First  Difference  of  Opening  Price  of  SBI  Stock  at  BSE 


Figure  3:  Second  Difference  of  Opening  Price  of  SBI  Stock  at  BSE 
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Various  ARIMA  models  are  now  applied  to  fit  the  best  model.  The  results  are  shown  in  Table  2 below. 


Table  2:  Various  Statistics  of  ARIMA  Models  on  Original  Values  of  Opening  Price  of  SBI  Stock  at  BSE 


Model 

(1,0,0) 

(0,0,1) 

(1,0,1) 

(2,0,0) 

(0,0,2) 

SSE 

14198.1 

8E+06 

14184 

16382 

2557886 

MSE 

28.3962 

16872 

28.368 

32.764 

5115.77 

RMSE 

5.32881 

129.9 

5.3261 

5.724 

71.5246 

WN  Variance 

28.3962 

16872 

28.368 

32.764 

5115.77 

MAPE(Diff) 

1.46761 

50.39 

1.4674 

1.6061 

26.8252 

MAPE 

1.46761 

50.39 

1.4674 

1.6061 

26.8252 

-2Log  (Like.) 

3099.4 

6289 

3098.9 

3170.7 

5695.11 

FPE 

28.5101 

16872 

28.481 

33.027 

5115.77 

AIC 

3103.4 

6293 

3104.9 

3176.7 

5701.11 

AICC 

3103.42 

6293 

3104.9 

3176.8 

5701.16 

SBC 

3111.82 

6302 

3117.5 

3189.4 

5713.75 

Table3:  Various  Statistics  of  ARIMA  Models  for  First  & Second 
Difference  of  Opening  Price  of  SBI  Stock  at  BSE 
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AIC 

1 
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Interpretation 

It  can  be  inferred  from  the  results  obtained  from  table  3 that  ARIMA  (0,  1,  1)  is  the  best  fit  model  based  on  the 
values  of  FPE,  AIC,  AICC,  SBC  Tests.  In  all  these  tests  the  obtained  values  are  minimal.  It  is  quoted  by  T.  Ozaki  in  his 
paper  on  the  Order  Determination  of  ARIMA  Models  (1977) 


Table  4:  Descriptive  Statistics  of  Opening  Price  of  SBI  Stock  at  BSE 


Model 

Observations 

Minimum 

Maximum 

Mean 

Std.  Deviation 

(1,0,0) 

500 

147.00 

332.80 

252.71 

41.87 

(0,0,1) 

500 

147.00 

332.80 

252.71 

41.87 

(1,0,1) 

500 

147.00 

332.80 

252.71 

41.87 

(2,0,0) 

500 

147.00 

332.80 

252.71 

41.87 

(0,0,2) 

500 

147.00 

332.80 

252.71 

41.87 

(0,1,0) 

500 

-19.25 

31.40 

0.13 

5.33 

(1,1,0) 

500 

-19.25 

31.40 

0.13 

5.33 

(0,1,1) 

500 

-19.25 

31.40 

0.13 

5.33 

(2,1,0) 

500 

-19.25 

31.40 

0.13 

5.33 

(0,1,2) 

500 

-19.25 

31.40 

0.13 

5.33 

(1,1,1) 

500 

-19.25 

31.40 

0.13 

5.33 

(0,2,0) 

500 

-45.08 

30.10 

0.03 

7.66 

(1,2,0) 

500 

-45.08 

30.10 

0.03 

7.66 

(0,2,1) 

500 

-45.08 

30.10 

0.03 

7.66 

(1,2,1) 

500 

-45.08 

30.10 

0.03 

7.66 

(2,2,1) 

500 

-45.08 

30.10 

0.03 

7.66 

(1,2,2) 

500 

-45.08 

30.10 

0.03 

7.66 

Graphical  Analysis  if  the  Results  Obtained 

Moreover  on  observing  the  plot  of  the  time  series  values  over  time  to  determine  stationary  or  non-stationary, 
autocorrelation  function  (ACF)  of  the  sample  also  reflects  the  data. 

If  the  ACF  of  the  time  series  values  either  cuts  off  or  dies  down  fairly  quickly,  then  values  of  the  time  series 
should  be  considered  stationary. 

Secondly  if  the  ACF  of  the  time  series  values  either  cuts  off  or  dies  down  extremely  slowly  then  it  should  be 
considered  non-stationary. 
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Graphs  below  shows  the  series  is  stationary. 

Results  of  ARIMA  (0,  1, 1) 


Figure  4:  ARIMA-1D(0,1,1)  Figure  5:  ARIMA-1D(0,1,1) 
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Figure  6:  ARIMA-1D(0,1,1)  Figure  7:  ARIMA-1D(0,1,1) 


Figure  8:  ARIMA- ID  (0,1, 1)  Figure  9:  ARIMA-1D(0,1,1) 


V.  RESULTS  AND  DISCUSSIONS 


The  experimental  results  of  SBI  stock  index  are  discussed  in  as  below.  Table  5 is  the  result  of  the  predicted  values 
of  ARIMA  (0,  1,  1)  considered  the  best  model  for  SBI  stock  index.  Figure  1 Ogives  graphical  illustration  of  the  level 
accuracy  of  the  predicted  price  against  actual  stock  price  to  see  the  performance  of  the  ARIMA  model  selected.  From  the 
graph  it  is  obvious  that  the  performance  is  satisfactory. 

Table  5:  Sample  of  Empirical  Results  of  ARIMA  (0, 1, 1)  of  SBI  Stock  index 


Date 

Actual  Price  (in  Rs.) 

Predicted  Stock  Price 
through  ARIMA  (in  Rs.) 

12/17/2015 

230 

224.54 

12/18/2015 

229.95 

224.12 

12/21/2015 

226.9 

223.15 

12/22/2015 

231.65 

236.1 

12/23/2015 

230.25 

225.34 
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Table  5:  Contd. 


12/24/2015 

230.85 

223.89 

12/25/2015 

228.3 

227.12 

12/28/2015 

229.05 

226.74 

12/29/2015 

228.85 

234.23 

12/30/2015 

229 

227.56 

12/31/2015 

225.95 

228.12 

1/1/2016 

225 

223.24 

1/4/2016 

226.95 

225.12 

1/5/2016 

222.8 

229.12 

1/6/2016 

217.9 

215.45 

1/7/2016 

214 

217.12 

1/8/2016 

212 

214.54 

1/11/2016 

206.8 

203.54 

1/12/2016 

204.85 

200.87 

1/13/2016 

201.35 

203.23 

1/14/2016 

197 

198.96 

1/15/2016 

197.8 

199.87 

1/18/2016 

184.9 

186.4 

1/19/2016 

182.15 

183.32 

1/20/2016 

180.65 

178.12 

Figure  10:  Graph  of  Actual  Price  vs.  Predicted  Values  of  SBI  Stock  Index 


CONCLUSIONS 

This  paper  presents  extensive  process  of  building  ARIMA  model  for  stock  price  prediction.  The  experimental 
results  obtained  from  the  daily  data  of  State  Bank  of  India  (SBI)  with  best  ARIMA  model  demonstrated  the  potential  of 
ARIMA  models  to  predict  stock  prices  satisfactory  on  short-term  basis.  Box-Jenkins  methodology  of  developing  ARIMA 
models  is  used  for  forecasting  the  stock  prices.  ARIMA  models  have  shown  efficient  capability  to  generate  short-term 
forecasts.  It  constantly  outperformed  complex  structural  models  in  short-term  prediction.  This  could  guide  investors  in 
stock  market  to  make  profitable  investment  decisions.  With  the  results  obtained  ARIMA  models  can  compete  reasonably 
well  with  emerging  forecasting  techniques  in  short-term  prediction. 
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